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Heisenberg formulated an uncertainty principle stating that there is a tradeoff between noise 
and disturbance when a measurement of position and a measurement of momentum are performed 
sequentially, and another principle imposing a limitation on the product of the uncertainties in a 
joint measurement of position and momentum. Heisenberg provided semiquantitative examples to 



. . ^ support his conjecture, but no proof. Are these formulations of the uncertainty relation sound? 

CN) , Here, we answer this question by analyzing two nondemolition measurements of position and 

T-H ■ momentum and describing the quantum state of the detectors. The former literature neglected the 

' possibility of pre-existing correlations between the two detectors. In case these are initially corre- 

CN \ lated, we prove the Heisenberg noise-disturbance principle not to hold. Furthermore, Ozawa and 

^ ■ Arthurs- Kelly inequalities are also violated. Finally, we show that, by preceding a momentum mea- 

5^ ' surement with a position measurement, the noise cancels out if the two probes are in an appropriate 

Q \ EPR state. 
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an ambiguous way: either a measurement of position 
with an uncertainty ex entails a subsequent disturbance 
on the momentum rip\x such that ex'rip\x ft, or a 
joint determination of position and momentum has un- 
certainties ex^p ^ fi- Furthermore, it is unclear whether 
Heisenberg was referring to the uncertainty e on the 
subensemble of particles for which a given outcome was 
found, or to the uncertainty A over the whole ensemble. 
Soon, Kennard and Weyl [7, 8] proved an inequality for 
the uncertainties of position and momentum, but with 
a fundamental difference from Heisenberg: the Kennard 
inequality axcrp > h/2 refers to the uncertainties ax, crp 
that one would obtain by measuring X ideally (i.e. with 
a detector that introduces no noise) on an ensemble of 
identically prepared particles, and then by measuring P 
on a distinct ensemble that is prepared in the same way 
and that has not undergone the X measurement. 

The noise-disturbance principle, the joint uncertainty 
principle, and the Kennard inequality were confused with 
one another for a long time. A breakthrough came with 
a paper by Arthurs and Kelly [G], who considered a 
joint measurement of position and momentum, assum- 
ing a von Neumann protocol and probes prepared in a 
pure Gaussian state. As Arthurs and Kelly considered 
the system to be initially uncorrelated from the probes, 
not only they found that Heisenberg principle for joint 
measurement held, but that the quantum nature of the 
probes increased the total spread of the outputs, yielding 
AxAp > axcp + > h. Here, a refers to the intrin- 
sic spread of the system before the measurement, and A 
to the spread of the pointer variables of the probes after 
the measurement. We call this relation the Arthurs-Kelly 
inequality. Although with a two-decade delay, the pio- 
neering result of Arthurs and Kelly stimulated further 
investigations on the uncertainty principle for joint mea- 



' Introduction. The questions that we ask (and answer) 
in this Letter are of fundamental importance: Does the 



noise-disturbance principle hold, i.e., is it true that in- 
I creasing the precision in a measurement of position dis- 
^ ' turbs in an uncontrollable way the momentum? How 
O^l do we quantify the disturbance? What happens if we 
measure position and momentum not sequentially, but 
CN ',. jointly? 

I These questions have been investigated theoretically 
' for more than two score years by a minority of physicists, 

■ but recent experiments [1-.3] have brought them to the 
^vq ] attention of the general public. 

■ Here, we apply Bohr's prescription of only asking ques- 
I ] tions that can be answered by experiments, however ide- 

fSJ ' alized these may be. To this end, we consider two non- 
demolition measurements of position and momentum, 
and describe both the probes and the system quantum- 
mechanically. Contrary to previous works, we allow the 
probes to be initially correlated. 

The additional term due to the correlations has sur- 
prising consequences: 

1 ) It makes possible to violate the Heisenberg uncertainty 
principle, in the noise-disturbance formulation [4]. 

2) It voids a recent noise-disturbance relation derived by 
Ozawa [■")]. 

3) It lowers, from to the limit established by 
Arthurs and Kelly [G] for the joint uncertainty principle. 

4) It allows, paradoxically, to counteract the noise intro- 
duced in a momentum measurement by preceding it with 
a position measurement, and vice versa. 
The results presented herein are valid for joint or se- 
quential measurements, for any initial preparation of the 
probes and of the system, and for arbitrary coupling 
strengths. 

Heisenberg [4] formulated the uncertainty principle in 
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surements of noncommuting variables [9-23] . 

The distinction between Heisenberg noise-disturbance 
principle and Kennard inequality became gradually clear 
[24], and it has given rise to a parallel line of research 
[5, 25-31]. 

General theory of quantum measurement. We call mea- 
surement any act of inference, even though in a proba- 
bilistic sense, about a system through the observation of 
a second system, the probe, that has interacted with the 
former. What sets apart the probe from the system is 
its ability to cause a sensation in a human being, if not 
directly, through a chain of amplifications and further 
interactions with the visible electromagnetic field. Here, 
the effect of this von Neumann chain [32] is accounted 
for by considering the readout state of the probe to be a 
mixed state. For clarity, let us suppose that system and 
probe are initially uncorrelated, so that the initial state 
is ppr CSi Psys- The probability of obtaining an outcome p 
from the probe is given by Born's rule 

P{p) = TT{{p'p,{fl) ® l)Uippr ® Psys)U^}, (1) 

where U is the time evolution operator, and Pp^ip) arc 
a family of readout density matrices that account for 
the rest of the von Neumann chain. They must satisfy 
the normalization Jdppp^{p) — 1, with dp a Lebesgues- 
Stieltjes measure, corresponding to a discrete or contin- 
uous distribution of outputs p. Furthermore, it is sen- 
sible to assume that the readout states are classical, i.e. 
[Pp^{pi), p'pj.{p2)] = 0, ypi,p2- This hypothesis allows 
to individuate one, or more, privileged basis, |J), the 
one that diagonalizes at once all p'p^{p). The existence 
of such a basis can be justified by the decoherence ap- 
proach [33, 34]. The Pp^ip) are labeled by the average 
of the operator J = JdJJ | J)(J| and they have a spread 
S'{p) in J, i.e., 

P = TypriJp'prifJ-)}, (2) 

6'\p) - TrprU'p'pM} - (Ti>{Jp;,(Ai)})' . (3) 

The spread S'{p) is but the resolution of the probe, and 
in principle it can be different for different outputs. 
For instance, wc could choose 

p'pM^ dJ\J){J\ (4) 

and let the readout p take discrete values pn spaced by 
{S'„ +6'^^i)/2 from one another (then dp is a distribution 
formed by a sum of Dirac deltas), or we could choose 

-(^ /•+°;, cxp[-(J-M)V2J'^] 

and let p CzR. In general, wc can write 

p'p,{p)^ fdJp{p\J)\J){J\, (6) 



with p(p\J) a probability distribution for p. Assuming 
a uniform resolution d'{p) = S' , Eq. (2) is satisfied for 
p{p\J) ~ f{p — J), where f{p) is a probability distribu- 
tion having zero average and spread 6' . 

The conditional state of the system for given p is 

Psys\^ ^ P{f^y^^rpr{{p'p,{p)<E>l)U{ppr<E>Psys)U^. (7) 

Of course, equivalent formulas arise when treating post- 
selected weak measurement [35, 3G], but with the differ- 
ence that U is expanded in a perturbation series and 
the roles of system and probe are reversed. Let us 
introduce the preparation basis [/), the one in which 
Ppr — Jdl w{I)\I){I\. Wc can rewrite the conditional 
state of the system 

^ j dJdIMjj[p)psysMl j{p) 

^^^^1^ SdJdITT,ys{E.Mpsys}' ^ ' 

We defined the generalized operations 

Mjj{p) = Vp{p\J)w{I){J\U\I), (9) 

and the generalized effects Ejj{p) = Mj j{p)Mjj{p). 
Both Mjj{p) and Ejj{p) arc operators on the Hilbcrt 
space of the system alone. If wc do not make the hy- 
pothesis of classical readout, Eq. (9) becomes Mjj{p) = 
y^p{p\J)w{I){p : J| [/[/), where \p : J) is the basis that 
diagonahzes Pp^ip)- 

When both Pp^ip) and ppr are pure states, the mea- 
surement is positive-operator valued [37-42]. In the gen- 
eral case, we can still define positive operators associated 
with the measurement 

E(p) = J dJdIEjj{p). (10) 

The operators E{p) can be inferred from the observed 
probabilities P{p) by changing the preparation psys- Re- 
covering Ejj{p), however, is a nontrivial task. If Ejj{p) 
arc known, the operations are determined modulo a uni- 
tary operator: Mjj(p) = Vjj{p)Eyj (p), where Ej^j is 
univocally defined as a positive operator. The additional 
operations Vjj{p) represent an unwanted and avoidable 
feedback on the system. Ideally, V,jj{p) = 1. Only by 
making subsequent measurements on the system, is it 
possible to check whether the measurement process is in- 
troducing this unwanted feedback. 

Definitions. We specialize the previous general result 
to a measurement of position and momentum, either se- 
quential or joint. The probe now consists in two probes, 
one measuring X the other K. We label each probe with 
the letter corresponding to the variable that it measures, 
and let A G {X, K} the generic label. Each probe inter- 
acts with the system through a potential Ha that com- 
mutes with the variable A. Furthermore, the interaction 
is assumed instantaneous, for simplicity. Instead of drop- 
ping Planck's constant and declaring that we are working 
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in units of ft = 1, we introduce the wave number K = P/h 
and factor out h in the interaction 



Hint — —h 



\x5{i + e)^xX + \K5{t-e)<^KK . (11 



Here, <&a is the variable of the probe A conjugated to 
J A, satisfying hence [I>a,^a] = i- For e Q~ K is 
measured first, then X, while for e — > 0+ the order is 
exchanged, and finally for e = the measurements are 
joint, a la Arthurs and Kelly [(>]. We shall absorb the 
coupling constants through the canonical scaling <^a ~^ 
Xa^a^ J a JaI^a- Thus, J A has the same dimensions 
as A, and $a has the inverse dimensions. 

We define the initial intrinsic variances of the system 
cr\ = {A'^) — (A)^, the initial variances of the probes 

5\ 



{J\) - {Ja? and 6\ = {^\) - ^^a) 



{J a) = Tr{ J^ppr} , = Tr{$^/5p,.}- and their cross- 
covariances 

e = {^xJk) - (I>x>(Ja-),k = {^kJx) - {^k){Jx). 

(12) 

Notice that the operators in the covariances commute, 
so there is no ambiguity. Recall that Kennard inequal- 
ity establishes that axcFR > 1/2 and 5a5a > 1/2, and 
that the covariances obey the Cauchy-Schwarz inequali- 
ties 1^1 < SxSk and |k| < SkSx- 

For later convenience, we introduce the column vectors 
(indicated by horizontal lists within square brackets) s = 
[x, k], (f) = , j = [jK,jx]- The components of the 
vectors are labeled by the index A^ sk = x and sx ~ k. 
We shall make use of the matrices 
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-1 




"0 



(13) 



The Moyal quasi-characteristic function M {x, k) [4.!] is 
defined as the Fourier transform of the Wigner quasiprob- 
ability W{K, X) [44]. Let Mgysis) the initial Moyal func- 
tion of the system, and Mpr{(j>,j) that of the probes. If 
the probes are initially uncorrelated, then Mpr{(t>,j) = 
Mx{4>x,jx)MK{4>K,jK)- However, we shall consider 
the general case. 

Noise and disturbance: operational definitions. We 
wish to discuss the uncertainty relation in the Heisen- 
berg formulation £aVa'\a ^ '''y where r is a positive real 
number (r ~ 1 in the argumentation of Heisenberg) , is 
the noise in the variable that is measured first, and riA'\A 
is the disturbance introduced in the second measurement 
by the first one. Thus, we need to agree on the definition 
of eA and r]A>\A- For definiteness, say that we measure 
first X, then K. 

We shall consider the noise ex that is introduced by 
the measuring apparatus, not that intrinsic to the sys- 
tem. We can determine ex by calibration: the system is 
prepared in p^ys and it is measured by a reference probe 
that introduces no bias and a known noise . By mea- 
suring the average, we know the intrinsic average (^)o 



of the system, and by estimating total variance of the 
outcome (A^"*)^, we infer the intrinsic uncertainty of the 

system ~ i^x^)'^ ~ (^x"*)^- Then we repeat the mea- 
surement with the probe that we want to characterize on 
an identically prepared system. The readout of the probe 
gives an average {X) and a variance A^,. We define the 
statistical noise = A^ — ct^- and the systematic error 
Dx = (X) - {X)o. The total error £x = (Dj^+ej^)^/^ is 
the sum in quadrature of the two errors. It may be con- 
troversial whether the systematic error would be included 
by Heisenberg in his formulation, or whether the noise to 
consider should be only the statistical one. In any case, 
it is always possible to make Dx zero while leaving ex 
unchanged. For a nondemolition measurement, this op- 
erational definition coincides with that given by Ozawa. 

Notice that in statistics, the statistical noise character- 
izes the precision, and the systematic error the accuracy. 
Furthermore ex represents the uncertainty in each indi- 
vidual measurement, that usually is the predictive uncer- 
tainty [17], while Ax the ensemble uncertainty [24]. 

We define the disturbance in an analogous way as we 
defined the noise. First, we prepare an ensemble of parti- 
cles in the state Psys, skip the X measurement {Xx = 0), 
and measure K. We find an average (K) with a statistical 
spread Ak- Then we repeat the procedure, but precede 
the i^T-measurement with an X-measurement. We find an 
average {K)\x with a statistical spread A^|x Here, we 
shall not condition the spread on a specific outcome Xq of 
the first measurement, even though we could. We define 
the statistical disturbance rj^^x ~ '^'k\x ~ ^/f- Notice 
that, a priori, rj'^ may be negative. A purely imaginary 
value for ry corresponds to the case where the "distur- 
bance" actually decreases the uncertainty, so that it is 
not a disturbance at all. We also define the systematic 
disturbance D^^x = {K)\x ^ i^)- Again, it may be 
controversial whether this second term represents a dis- 
turbance as meant by Heisenberg. Nonetheless, Dx\x 
can be made zero without changing r]x\x- The total 
disturbance is the sum, in quadrature, of the statistical 
and systematic disturbances 'Dx\x = ~^ ^\:\x)^^^ ■ 

Even for nondemolition measurements, this definition dif- 
fers from the one given by Ozawa, since the latter does 
not describe the second measurement process, and thus 
postulates implicitly that the second probe is making an 
ideal measurement and is uncorrelated to the first one. 

It is important to remark that in principle both the 
noise and the disturbance could depend on the state of 
the system. This dependence has been pointed out as un- 
desirable [45], but we see no problem in this: depending 
how a system is prepared, the effect of a probe on it could 
be more or less pronounced. Nonetheless, in the following 
we shall prove that, for the nondemolition measurements 
considered here, noise and disturbance are properties of 
the probes alone. 

Results. Wc shall use the relation between the final 
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Moyal function for system and probes in terms of the 
initial ones [4G, 47] 

M{s; <t>,j) =Msys{s + (j))Mpr {(t>,i + 2aos + a^ct)) ■ (14) 

The symbol e G {+, —,0} specifies the order of the mea- 
surements, as in Eq. (11). References [46, 47] assumed 
as readout states p'p^ip) = \J = fi){J = In this 
case, the characteristic function of the readout probabil- 
ity Z((j>) = Jdfj, exp [« J • 0]n(J) is obtained by tracing 
out the system (s = 0) and by putting j = in Eq. (14). 
In a realistic measurement with finite resolution, how- 
ever, p'prip) = / dJ p{^i\J)\J){J\. We recall that for 
uniform resolution p{p\J) = /(/i — J). The results of 
Refs. [46, 47] are then easily generalized thanks to the 
convolution theorem: the characteristic function, i.e. the 
Fourier transform of the joint probability of observing 
/i = (/i_fs-, /ijf) as the output, 

Z{<P)^ j dp.e't'-'f'Pi^i) =j dp.dJ e'" '!' p{p.\J)W{J), (15) 

is obtained by adding a factor z{(j)) = J (i/xexp[i/i-0]/(/i), 
Z{4>) ^z{cp)Ahys{^)Mp,.{^,ae4>). (16) 

For definiteness, we consider e = +. By taking the sec- 
ond derivatives of ln(Z), we obtain the statistical vari- 
ances of the readout 

l^\ = al+5l + 5'l (17) 
^l\x + + ~5\ +51 + 2k. (18) 

The statistical error is e\ — 5\ + 5'^, and, since A|. = 
a\ + 5\ + S'l, the statistical disturbance is 

vl\x = S'x+'2^- (19) 

As we noted earlier, we can make the systematic error 
Dx and disturbance Dj^^x vanish, by applying two ap- 
propriate translations to the initial state of the probe 
(since we are looking for minimal disturbance and error, 
it is sensible to assume that the probe X is not biased). 

Violation of the noise-disturbance principle. For ini- 
tially uncorrelated probes k = 0, so that rj'^^x ~ ^x- 
The Heisenberg noise-disturbance relation becomes then 

exVK\x=6x^Jsj,+5'l>^. (20) 

Thus, if the probes are not correlated, the noise- 
disturbance relation for position and momentum holds, 
in agreement with previous results [25, 27, 45]. 

However, if the probes are correlated, k can be nega- 
tive. The disturbance has the bounds 

~Sx{6x-26k)<t]Iix<5xC6x+26k). (21) 



If 5x < 25k, the lower limit is negative, so that the dis- 
turbance can be arbitrarily small, invalidating Heisen- 
berg's argument. Other possibilities leading to this vi- 
olation explored in the former literature are the EPR 
thought-experiment [48], and Ozawa's model [25]. Let 
us discuss them briefly: in the EPR setup, one particle 
works as a probe, but the two particles are initially cor- 
related. Thus the hypothesis that system and probes are 
initially uncorrelated is violated. Furthermore, due to 
this limitation, the EPR scheme does not work for any 
initial state of the system. On the other hand, Ozawa vi- 
olation relies on a single probe whose coordinate y gives 
information about the momentum p^. of a particle (pos- 
sibly coinciding with the probe) and whose momentum 
Py gives information about x. In this case, one can mea- 
sure either y or Py, but not both, hence it is questionable 
whether Ozawa's model produces a genuine violation of 
the noise-disturbance principle, as noise and disturbance 
refer to different setups. 

Violation of Ozawa relation. According to Ozawa [5], 
the following relation holds 

1 . X 

<^XllK\X + <^xcrK + (^XllK\X > ^- (22) 

However, we have demonstrated that rj^^x can be zero. 
Ozawa's relation then reduces to exO'K > 1/2. This in- 
equality can be violated, since ex and aK arc indepen- 
dent of each other, the first referring to the probe, the 
second to the system. This does not show that Ozawa's 
relation is wrong, as it is a mathematical identity, but 
that the definition of disturbance upon which it relies is 
incompatible with our operational definition, since it dis- 
regards the possibility of correlated probes, and therefore 
it should be discarded. 

Violation of the Arthurs-Kelly limit. Heisenberg also 
formulated the uncertainty principle in terms of the mu- 
tual uncertainty introduced in a joint measurement of 
momentum and position. Thus, let us consider the case 
of joint measurements, e = 0. The variances are 

A'x = <y^x+5jc + 5'i + \5l~^ 

>cTjc + {5x-l~5K? + 5'l, (23) 

AI = <jI+ 5l + 5f, + idl + n 

>crl + {5K-\5xf + 5'l. (24) 

If we interpret Heisenberg's argument as referring to 
the noise introduced by the measurement, then e^^/f — 
{5\+5'^ + \pJi~^){5\+5'l+\Px+^<')^'^i^s, no lower honnd. 
For instance, we could make $k perfectly correlated to 
Jx, then adjust the couplings so that 25 x ~ 5k and let 
the resolution 5'x 0. However, if we, as Arthurs and 
Kelly did [d], interpret Heisenberg's argument as referring 
to the total uncertainty, that intrinsic to the measured 
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system plus the noise introduced by the measurement, 
then Ajf Ajf > 1/2. This value is to be contrasted to the 
one commonly accepted [10] /S.x^k > <^x<^k + 1/2 > 1- 
The reason for the discrepancy lies once again in the im- 
plicit assumption of uncorrelated probes made in the for- 
mer literature. 

Cancelling the noise. Finally, let us go back to the case 
of sequential measurements, and assume perfect anticor- 
relations between $x and , so that k = —5x5k- For 
instance, we could prepare the probes in an EPR state. 
Equation (18) becomes then 



'-^K\X 



'K 



(25) 



By making 6k ~ the contributions of the probes to 
the variance partially cancel out, leaving only the contri- 
bution of the finite resolution, which can be made arbi- 
trarily small, in principle. Remember that we concealed 
the coupling constants by rescaling the variables. Let us 
restore them momentarily, so that the equality we want 
to reach is Xj^Sk ~ ^xSx- This can be realized simply 
by changing Xk^x, without acting on the initial state 
of the probes. The same considerations hold when the 
ordering of the measurements is exchanged. 

Conclusions. We have demonstrated how initial cor- 
relations in the detectors invalidate many results com- 
monly believed to hold, and how this can be exploited to 
reduce the measurement noise. The Moyal formalism al- 
lowed to unify the treatment of joint and sequential mea- 
surements. An interesting question to be investigated is 
whether classical correlations are sufficient to violate the 
Heisenberg, Ozawa, and Arthurs-Kelly limits, or whether 
the detectors should be prepared in an entangled state. 
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